Compactness of the space of non-randomized policies in countable-state sequential decision processes

نویسندگان

  • Richard C. Chen
  • Eugene A. Feinberg
چکیده

For sequential decision processes with countable state spaces, we prove compactness of the set of strategic measures corresponding to nonrandomized policies. For the Borel state case, this set may not be compact [14, p. 170] in spite of compactness of the set of strategic measures corresponding to all policies [17,2]. We use the compactness result from this paper to show the existence of optimal policies for countable-state constrained optimization of expected discounted and nonpositive rewards, when the optimality is considered within the class of nonrandomized policies. This paper also studies the convergence of a value-iteration algorithm for such constrained problems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

COUNTABLE COMPACTNESS AND THE LINDEL¨OF PROPERTY OF L-FUZZY SETS

In this paper, countable compactness and the Lindel¨of propertyare defined for L-fuzzy sets, where L is a complete de Morgan algebra. Theydon’t rely on the structure of the basis lattice L and no distributivity is requiredin L. A fuzzy compact L-set is countably compact and has the Lindel¨ofproperty. An L-set having the Lindel¨of property is countably compact if andonly if it is fuzzy compact. ...

متن کامل

One-point extensions of locally compact paracompact spaces

A space $Y$ is called an {em extension} of a space $X$, if $Y$ contains $X$ as a dense subspace. Two extensions of $X$ are said to be {em equivalent}, if there is a homeomorphism between them which fixes $X$ point-wise. For two (equivalence classes of) extensions $Y$ and $Y'$ of $X$ let $Yleq Y'$, if there is a continuous function of $Y'$ into $Y$ which fixes $X$ point-wise. An extension $Y$ ...

متن کامل

Denumerable Constrained Markov Decision Processes and Finite Approximations

The purpose of this paper is two fold. First to establish the Theory of discounted constrained Markov Decision Processes with a countable state and action spaces with general multi-chain structure. Second, to introduce nite approximation methods. We deene the occupation measures and obtain properties of the set of all achievable occupation measures under the diierent admissible policies. We est...

متن کامل

Denumerable Constrained Markov Decision Problems and Finite Approximations Denumerable Constrained Markov Decision Problems and Finite Approximations

The purpose of this paper is two fold. First to establish the Theory of discounted constrained Markov Decision Processes with a countable state and action spaces with general multi-chain structure. Second, to introduce nite approximation methods. We deene the occupation measures and obtain properties of the set of all achievable occupation measures under the diierent admissible policies. We est...

متن کامل

Utilizing Generalized Learning Automata for Finding Optimal Policies in MMDPs

Multi agent Markov decision processes (MMDPs), as the generalization of Markov decision processes to the multi agent case, have long been used for modeling multi agent system and are used as a suitable framework for Multi agent Reinforcement Learning. In this paper, a generalized learning automata based algorithm for finding optimal policies in MMDP is proposed. In the proposed algorithm, MMDP ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Math. Meth. of OR

دوره 71  شماره 

صفحات  -

تاریخ انتشار 2010